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Fig. 4 Molecular surface of the catalytic domain colore „ 
potential (red = negative, blue = positive, white = uncharged). The active 
site is buried within the channel and appears largely negative. In the 
holotoxin, this pocket is partially blocked by the translocation domain 
belt. Figure generated using GRASP 37 . 



standard deviations over the expected where pairs with Z < 2 
are structurally dissimilar ) The top twelve proteins after 
tetanus toxin (Z = 26.3) that are structurally similar to the N- 
terminal sub-domain are proteins known to interact with sug- 
ars (for example, serum amyloid P (lsac-A, Z = 12.8), 
B-glucanase (2ayh, Z = 11.1), sialidase (lkit. Z = 8.9), and 
lectin (lied, Z =8.8)). Perhaps the most notable of these 
twelve are cryia (lciy, Z = 5.2) and insecticidal 3-endotoxin 
(1 die, Z = 4.8) which act by binding glycoproteins and creating 
leakage channels 3 . These toxins have binding domains with 
motifs similar to the N-terminal sub-domain of BoNT/A but 
dramatically different pore-forming domains. A Dali search of 
the C-terminal sub-domain reveals more sugar binding pro- 
teins (basic fibroblast growth factor (Ibfg, Z = 9.1), agglutinin 
(Ijly-A, Z = 9,9)) and the toxin abrin (labr-B. Z = 8.8). Both 
abrin and the related ricin bind their targets through a |3-tre- 
foil binding domain. The appearance of different subsets of 
the same structural motifs in different toxins could suggest a 



mechanism of evolution in which stable functional domains 
are assembled as modular units giving rise to toxicity. 

Pore-formation and translocation domain 

Following ceil surface binding and receptor-mediated endocyto- 
sis of the neurotoxin, an acid-induced conformational change in 
the neurotoxin's translocation domain is believed to allow the 
translocation domain to penetrate the endosome and form a 
pore. The membrane interaction and pore formation is thought 
to facilitate the passage of the catalytic domain across the mem- 
brane into the cytosol. The details of how the translocation 
domain changes conformation at acidic pH to form a pore and 
how it can allow for the passage of a 50,000 M r catalytic domain 
across the endosomai membrane are the least understood 
aspects of the intoxication mechanism. To date, most investiga- 
tions of the translocation event in BoNT have assumed a similar- 
ity to other pH dependent ct- helical pore-forming proteins: 
diphtheria toxin, coiicin A, 8-endotoxin, pseudomonas exotoxin, 
and Bd-x L 5 - 13 . These proteins share a common structural motif 
in their pore forming domain 531 *, however this motif is not 
observed in the BoNT/A structure. 

The translocation domain of BoNT/A wraps around the cat- 
alytic domain before forming its main body, a cylindrical shape 
with dimensions of 28 A x 32 A x 1 05 A (Fig. 2) . The most salient 
feature of the translocation domain is a pair of a helices 105 A 
long corresponding to residues 685-827. While unusual, long 
pairs of a- helices have been observed recently in the structures of 
coiicin la 5 5 and the nucleotide exchange factor GrpE 16 , The 
helices, anti -parallel and amphipathic, twist around each other 
like a coiled coil but do not adhere to a strict: heptad repeat. At 
both ends of this pair of helices, a shorter a helix packs in paral- 
lel to the long helical axis. In addition, the domain has two 
strand- like sections which pack against the pair of a-helices in a 
parallel fashion. In an effort to identify the pore-forming seg- 
ment of BoNT/A. the primary sequence was searched for pre- 
dicted amphipathic helicity. and the candidate segment (residues 
659-681) was shown to increase permeability of lipid biiayers' 7 . 
The present structure, solved at neutral pH, shows that none of 
this putative transmembrane segment is helical and that, in fact, 
part of it appears to be in one of the two strand -like segments 
packing against the long a-helices. This could indicate an area 
that will undergo structural changes with pH. However, the 



Fig. S Stereo diagram of the catalytic domain 
active site. A cr A -weighted f m ■ f ak omit map 
contoured at 2.6c and centered around the zinc 
shows the amino acids (green carbon, red oxy- 
gen, and blue nitrogen bonds) thought to be 
critical for full catalytic activity. Figure generat- 
ed using BOBSCRIPT. 
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Table 1 Structure determination statistics 

Diffraction data statistics 



Crystal 


Resolution (A) 


Total obs. 


Unique refi 


Completeness 1 


(%) 


R™rg. 2 (%) 


Native 


20-3.3 


172,169 


40,110 


98.2(94.3) 




11.1(48.3) 


CH 3 HgCl 


15-4.5 


87,457 


15,830 


97.2(96.0) 




10.7(27.7) 


Hg 2 (Ac) 2 


15-4.2 


86,260 


19,650 


97.8(94.3) 




9.1(24.5) 


SmAc 


15-4.5 


22,797 


12,516 


78.5(78.2) 




7.6(17.4) 


UAc 


15-4.5 


22.795 


11,796 


71.9(73.6) 




7.8(22.5) 


KAuC! 4 


15-6.0 

tistics 


19,894 


6,080 


93.7(95.5) 




9.4(18.2) 


Phasing stt 

Derivative 


Number 


Isomorphous 


Phasing power 5 








of sites 3 


R- factor 4 (%) 


Cent. 


Acent. 


Cent. 


Acent. 


CH 3 HgCI 


6 


21.7 


1.03 


1.64 


0.68 


0.65 


Hg 2 (Ac) 2 


6 


21.0 


0.96 


1.36 


0.76 


0.76 


SmAc 


3 


14.9 


0.77 


1.07 


0.83 


0.87 


UAc 


3 


15.2 


0.66 


1,04 




0.91 


KAuCU 


1 


15.2 


0.53 


1.03 


0.88 


0.88 


FOM before 


solvent flattening (1 5-3.6 A) 


0.48 










FOM after solvent flattening (1 5-3 6 A) 
Refinement statistics 


0.74 










R-factor/R fre8 


20.0/27.9 










Rm.s.d. bon 


d lengths 


0.008 A 











R.m.s.d. bond angles 1.5' 



'Numbers in parentheses indicate statistics for highest resolution shells. 

2 R m «rg. = ZhS I Im - <Ihi> I / ! In |. where h specifies unique indices, i indicates symmetry equivalent observations of h, and <I M > is the mean value. 
3 The mercury and gold derivatives shared overlapping sites. Similarly, the samarium and uranyl derivatives shared sites. However, the different 
occupancies yielded enough new phase information to improve the quality of the maps. 

♦The isomorphous R-factor = _„ j| f m j - F, | / | F, |, where | F m | and | F, | are the measured structure factor amplitudes of the derivative and native 

Phasing power is the mean value of the heavy atom structure factor amplitude divided by the residual lack of closure error. 
6 Rcuiii5 is the mean residual lack of closure error divided by the isomorphous difference. 



residues most likely to titrate over this pH range are the two his- 
tidines in the translocation domain, which are located some dis- 
tance from this region of the structure: His 551 and His 560 are 
found in a loop between the translocation domain belt and the 
main body of the translocation domain. Thisjunction may play a 
role in exposing a hydrophobic segment of the protein or releas- 
ing the catalytic domain from the translocation domain. 
Regardless of the pore-forming segment location, it is clear that 
the translocation domain of BoNT/A is structurally distinct 
from the other pore-forming toxins. In fact, the long pair of 
a- helices with their triple helix bundles at either end bear more 
resemblance to the coiled coil viral proteins: HIV-1 gp41/GCN4, 
influenza hemagglutinin, and the MoMuLV TM fragment 18-20 . 
These proteins do not translocate through pores but do have an 
acid-induced ability to undergo structural changes and penetrate 
membranes. 

Catalytic domain 

Either within the acidic endosome or upon exposure to the 
cytosol following translocation, the disulfide bond connecting 
the catalytic and translocation domains (Cys 429-Cys 453) is 
reduced, and the catalytic domain is released into the cytoplasm. 
The catalytic domain. 55 Ax 55 Ax 62 A, is a mixture of both ct- 
helix and (3-strand secondary structure, in agreement with sec- 
ondary structure predictions 21 . The active site of the catalytic 
domain is buried 20-24 A deep in the protein, has a negative sur- 
face charge, and is accessible by a channel, -12 A x 15 A x 35 A, 
(Fig. 4). In the di-chain holotoxin, this channel is partially 
shielded from solvent by both the belt and the main body of the 
translocation domain. 



The final mechanistic step in toxicity is the cleavage of a pre- 
synaptic protein by the BoNT zinc protease catalytic domain. 
The catalytic zinc atom represents the highest peak in the elec- 
tron density maps and is visible in MIR maps contoured at 7.5o\ 
Amino acids with side chains closest to the zinc include His 222, 
Glu 223, His 226, Glu 261 and Tyr 365 (Fig. 5). While the pres- 
ence of His 222, Glu 223 and His 226 was anticipated 1 , this struc- 
ture identifies Glu 261 as the fourth ligand, and may help resolve 
uncertainty in tetanus toxin where either Glu 269 or Glu 270 had 
been implicated 22 . (Glu 269 and Glu 270 of tetanus toxin align to 
Glu 260 and Glu 261 of BoNT/A). Further, while the role of a 
tyrosine was anticipated 23 24 , the structure indicates that it is the 
conserved Tyr 365 and not the conserved Tyr 232 which is within 
proximity to the zinc. While exact bond lengths and water mole- 
cules cannot be confirmed at this resolution, the observed orien- 
tation of these residues support a model in which the His 222, 
His 226 and Glu 261 directly coordinate the zinc, and Glu 223 
coordinates a water molecule as the fourth ligand. The Tyr 365 
residue is -5 A from the zinc (OH-Zn) and is more likely to be 
involved in secondary bonding networks or interaction with 
substrate. A Dali search 8 shows that the proteins with the most 
structural similarity to the catalytic domain are thermolysin 
(lhyt. Z = 4.6) and leishmanolysin (11ml. Z = 2.4). two other 
zinc proteases with the same conserved HEXXH sequence. The 
structural similarities are limited, however, to the helix contain- 
ing the HEXXH sequence and a four-stranded (5-sheet buttress- 
ing the helix. Beyond this overlap, the BoNT/A catalytic domain 
has different secondary structure elements and connectivities. 

The catalytic domain of BoNT/A is highly specific for the 
C terminus of SNAP- 25 and appears to require a minimum of 
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16 substrate amino acids for cleavage 25 . This large substrate size 
requirement is unusual for metalloproteases. The molecular 
recognition properties between the toxin and SNAP-25 and the 
dependence of recognition on a specific substrate secondary 
structure is presently being investigated 2627 . The channel 
(-12 A x 15 A x 35 A) leading to the active site (Fig. 4) appears 
capable of accommodating 16 or more residues with the 
Gln-Arg cleavage site of SNAP-25 able to insert into the zinc cat- 
alytic site. Additionally, the high pi of SNAP-25 s C- terminal tail 
suggests that the negatively charged surface observed near the 
active site could be critical in the docking of substrate. 
Interestingly, the active site is most accessible in the absence of 
the translocation domain. This structure, in which the transloca- 
tion domain shields the active site in the unreduced holotoxin, 
explains the fact that the catalytic activity in in vitro experiments 
is greatly enhanced by reduction of the disulfide. This observa- 
tion of an occluded active site, along with the characterization of 
the amino acid environment, will almost certainly benefit ongo- 
ing inhibitor design. 

Methods 

Crystals were grown as described", but with a modified mother 
iiquor of 150 mM magnesium acetate, 11% PEG 4000, and 100 mM 
tris-HCI pH 7. The crystals contain one monomer per asymmetric 
unit and are in space group P3,21 (a = b = 170.5 A, c » 161.1 A) with 
a solvent content of 70%. Derivatives were prepared by soaking 
crystals in: 1 mM potassium tetrachloroaurate for 8-1 2 h, 1 mM mer- 
curous acetate for 6 h, 3 mM methyl mercuric chloride for 6 h, 
50 mM samarium acetate for 12 h, or 0.5 mM uranyl acetate for 
6-10 h, Diffraction data for native and denvatized crystals were col- 
lected at 4 °C at SSRL, beamline 7-1 and bearnline 9-1 , at 1.08 A and 
0.98 A respectively. Crystals diffracted reasonably well for about five 



images using 1° oscillations before showing signs of radiation dam- 
age. Taking advantage of the high symmetry and the ability to 
translate bar shaped crystals allowed for complete data sets to be 
collected from a few crystals. All data were processed with DENZO 
and SCALEPACK", allowing for decreased resolution and increased 
mosaicity as the crystal decayed. The CCP4 suite of programs 30 was 
used for scaling the data sets and locating the heavy atom positions. 
Patterson maps were viewed with XCONTUR 31 . MLPHARE 30 was used 
for heavy-atom position refinement and phase calculation. Phases 
were extended to 3.6 A and improved by solvent flattening and his- 
togram matching using the program DM 32 . The model was built to 
3.6 A resolution using the program O 33 , and then, as more data was 
collected, extended to 3.3 A. Maps were improved with phase com- 
bination using SIGMAA 3 *. Cycles of rebuilding, positional refine- 
ment, and simulated annealing using X-PLOR 35 were continued until 
convergence. Excellent electron density exists throughout the 
model and 99% of the amino acids were visible. Not surprisingly, the 
two ill-defined regions of the structure are the site of proteolytic 
cleavage between the two chains and in a surface loop immediately 
following His 560 of the translocation domain. 
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